A New Over-sample Method Based on Distribution Density

نویسندگان

  • Kuoyi Shao
  • Yun Zhai
  • Haifeng Sui
  • Changsheng Zhang
  • Nan Ma
چکیده

A new method was proposed for leaning from the imbalanced dataset based the samples distribution density in this paper. In the proposed scheme, a model of samples distribution density was designed, followed by the improved smote progress SDD-SMOTE where we smoted the minority samples according to the samples distribution density. Cross-validation results show that proposed SDD-SMOTE method to some extent improves the minority prediction in both the recall and the precision metrics.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sampling from social networks’s graph based on topological properties and bee colony algorithm

In recent years, the sampling problem in massive graphs of social networks has attracted much attention for fast analyzing a small and good sample instead of a huge network. Many algorithms have been proposed for sampling of social network’ graph. The purpose of these algorithms is to create a sample that is approximately similar to the original network’s graph in terms of properties such as de...

متن کامل

Optimal power flow based on gray wolf optimization algorithm using probability density functions extraction considering wind power uncertainty

In recent years, utilization of the renewable based power plants has become widespread in the power systems. One of the most widely used renewable based power plants is wind power plants. Due to the utilization of wind energy to generate electricity, wind turbines have not emitted any environmental pollution. Thus, in addition to economic benefits, utilization of these power plants is of great ...

متن کامل

The new generalized inverse Weibull distribution

Failure rate is one of the important concepts in reliability theory. In this paper, we introduce a new distribution function containing four parameters based on inverse Weibull distribution. This new distribution has a more general form of failure rate function. It is able to model five ageing classes of life distributions with appropriate choice of parameter values so that it is displayed decr...

متن کامل

Parameter Estimation of Some Archimedean Copulas Based on Minimum Cramér-von-Mises Distance

The purpose of this paper is to introduce a new estimation method for estimating the Archimedean copula dependence parameter in the non-parametric setting. The estimation of the dependence parameter has been selected as the value that minimizes the Cramér-von-Mises distance which measures the distance between Empirical Bernstein Kendall distribution function and true Kendall distribution functi...

متن کامل

Estimation of Density using Plotless Density Estimator Criteria in Arasbaran Forest

    Sampling methods have a theoretical basis and should be operational in different forests; therefore selecting an appropriate sampling method is effective for accurate estimation of forest characteristics. The purpose of this study was to estimate the stand density (number per hectare) in Arasbaran forest using a variety of the plotless density estimators of the nearest neighbors sampling me...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JCP

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2014